Indicators Ratings Parallel =  11 ms
Indicators Ratings =  10 ms
CPU compute ratings: 1 iteration avg time: 2151 ms
 
 ---- 
cuda_malloc<d_ratings> 0.78
cuda_malloc<d_companies_map> 0.66
kernel<computeRatingsKernel> 1.19
kernelSegment<sort> 3.74
toHostMemory<d_ratings> 1.43
toHostMemory<d_companies_map> 1.16
ranking 2.15
TotalRankingTime 11.43

cuda_malloc<d_ratings> 0.25
cuda_malloc<d_companies_map> 0.12
kernel<computeRatingsKernel> 1.17
kernelSegment<sort> 1.1
toHostMemory<d_ratings> 1.09
toHostMemory<d_companies_map> 1.14
ranking 2.12
TotalRankingTime 7.31

cuda_malloc<d_ratings> 0.25
cuda_malloc<d_companies_map> 0.08
kernel<computeRatingsKernel> 1.16
kernelSegment<sort> 1.16
toHostMemory<d_ratings> 2.09
toHostMemory<d_companies_map> 2.07
ranking 2.93
TotalRankingTime 10.18

GRID K520 :  797.000 Mhz   (Ordinal 0)
8 SMs enabled. Compute Capability sm_30
FreeMem:   3908MB   TotalMem:   4096MB   64-bit pointers.
Mem Clock: 2500.000 Mhz x 256 bits   (160.0 GB/s)
ECC Disabled


